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Abstract 

The construction of a spatio-temporal wavelet and its tuning to speed was first realized in the 
90s on the Morlet wavelet by M. Duval-Destin [11, 12]. This enabled to demonstrate the ca- 
pacities of the speed-tuned Morlet for psychovisual analysis. This construction was also used 
very efficiently in a powerful aerial target tracking algorithm by Mujica et al.[16, 17]. In the last 
decade, this tool was proposed as an elegant and efficient alternative framework to the Optical 
Flow (OF), the Block Matching (BM) or the phase difference, for the study of motion estimation 
in image sequences. Nevertheless, the aperture selectivity of the 2D+T Morlet wavelet presents 
some difficulties. Here we propose to replace the 2D Morlet wavelet by a Gaussian-Conical (GC) 
wavelet for the spatial part of the spatio-temporal wavelet, since the GC wavelet has a better 
aperture selectivity and allows a very simple adjustment of the aperture. Therefore we build 
a new, highly directional, speed-tuned wavelet called Gaussian-Conical-Morlet (GCM) wavelet. 
Like the speed-tuned 2D+T Morlet, the new wavelet presents very good characteristics in mo- 
tion estimation and tracking, namely long temporal dependence, robustness to noise and to 
occlusions, and supersedes the OF (Optical Flow) and BM (Block Matching) techniques. How- 
ever, for aperture selectivity, directional speed-capture and spectral recognition and tracking, 
GCM easily outperforms Morlet. This paper describes the GCM construction, utilization and 
aperture performances. 

Keywords: Extraction of motion parameters, speed tuning, continuous wavelet transform, 
directional and conical wavelets, angular and aperture selectivity 



1. Introduction 

The continuous wavelet transform has proved to be a very efficient tool for signal analysis. In 
the late 80s and the 90s, developments to adapt the wavelet transform to various motions have 
been proposed by M. Duval-Destin and R. Murenzi [12]. 

The group of analysis parameters, i.e., usually position, scale and rotation, has been extended 
to speed, acceleration and deformation. This has led to various types of time dependent wavelets 
[6, Chap. 10]. Then a very performant algorithm for missile tracking was set up by Mujica et al. 
[16, 17] using such wavelets (see also [6, Chap. 10]). In [13, 19], the authors use the same energy- 
density based algorithm, but with Expectation-Maximization plus Gaussian mixture approach 
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and scale functional relation plus ST processing blocks. Later, in [20], the second named authors 
also use image transformation to time-varying (ID+T) signals. 

The advantage of velocity detection with the motion-tuned spatio-temporal CWT over other 
known methods, like Optical Flow (OF) [7], Block Matching (BM) and phase difference has been 
already discussed in [8, 9]. These methods work on the motion of pixels or of blocks, but not 
on regions or objects. They assume that the object is constant from frame to frame and that 
the object signature does not change with time. They are not inherently scalable either. Like 
BM, OF has a short time dependence, which is not very accurate for slow motion or trajectory 
estimation. The four characteristics of object tracking, by spectral signature or spatial scale, 
long temporal dependence, robustness to noise and robustness to occlusions, are the strength of 
wavelet analysis, and we plan to show it further for pertinent feature extraction and recognition 
in sequence analysis, for object tracking, for video compression and for video data mining. 

Because of its compactness both in position space and in frequency space, but also because 
the symmetry of its envelope, the Morlet wavelet was first chosen to be tuned to speed. In 
Fourier space, this wavelet reads as 

ipM{k) = V~e[exp{-^\A-\k-ko)\^)-exp{-^\A-%\^) exp(-i|A-ifc|2)), (1.1) 

where A = diag[l, e~^/^], e ^ 1, is a 2 x 2 anisotropy matrix and the correction term, which 
ensures admissibility of the wavelet, is usually dropped (see [6, Eq(3.18)]), since it is negligible 
for practical values of the parameters. 

Nevertheless numerous difficulties remain when using this wavelet in directional analysis. 
Although it has a good capability for directional filtering, its aperture selectivity is poor. It 
is directional in the sense defined in [4] and [6, Sec. 3. 3], namely, ^A wavelet '0 is said to be 
directional if the effective support of its Fourier transform '0 is contained in a convex cone in 
spatial frequency space^\ but the anisotropy parameter e > 1 is needed in order to get a decent 
angular selectivity. In addition, the Morlet wavelet has a major drawback: its angular selectivity 
increases with the length of the wave vector /cq, since the support cone gets narrower, but at the 
same time the amplitude decreases as exp(— |A:op). 

In order to achieve a more efficient directional wavelet, a better method is to consider a smooth 
function with support in a strictly convex cone C and behaving inside this cone as P{k)e~^'^ 
where C ^ C and P(-) is a polynomial. This leads to the conical wavelets, in particular the 
Cauchy wavelet and the Gaussian-conical wavelet, if the exponential is replaced by a Gaussian 
[4, 6]. These are genuine directional wavelets that don't suffer from the defects of the Morlet 
wavelet. We have used these wavelets as a basis for a new construction of motion-tuned, and 
in particular speed-tuned wavelets. The development of these wavelets and their use in motion 
analysis is the aim of the present paper. 

2. Preliminaries: the 2D continuous WT 

In order to motivate our construction and to fix notations, we begin with a brief reminder of 
the 2D continuous wavelet transform (CWT), following [6, Chap. 2]. A 2D wavelet is a function 
^l) G L^(M^, dx) satisfying the admissibility condition 

djMj£<eo, (2.1) 

where '0 is the Fourier transform of '0. In practice, this condition is often replaced by the slightly 
weaker one 

^(0) = 0^ / df^(f) = 0. (2.2) 
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Given a 2D signal (an image) s G L^(R^), its CWT with respect to the wavelet is given by 
the inner product 

= a-^ / dxt/j{a-^r-%x-b)s{x), (2.3) 

JR2 



a / 

JR2 



d/ce^"^(ar-0(/c)5(/c), (2.4) 



where the overbar denotes complex conjugation, i^^^Q is a copy of '0 translated by 6 G M^, 
dilated by a factor a > 0, and rotated by an angle 9 G [0, 27r], that is, 

V'K,a,e = «"'V'(«-V^-'(^-^)), (2.5) 

where is the familiar 2x2 rotation matrix of angle 9 

cos 9 — sin 6> 
sin 9 cos 9 

Inverting the transform, we obtain the reconstruction formula 

«(^) ^c-' JJJ db^dO i'ji^^/x) W^s{b, a, 6). (2.6) 

These formulas will be extended to the 2D+T case in Section 4.2 below. 
Next comes the choice of wavelet. Two possibilities are available: 

(i) Isotropic wavelets^ i.e., rotation invariant wavelets, well adapted for a pointwise analysis; 

(ii) Anisotropic wavelets: when the aim is to detect oriented features in an image, which is the 
case here, one has to use a wavelet which is not rotation invariant. The best angular selectivity 
will be obtained if '0 is directional [3, 4], which means that the (numerical) support of ifj in 
spatial frequency space is contained in a convex cone with apex at the origin (that is, that the 
wavelet is numerically negligible outside the cone). When this condition is not respected, as in 
the anisotropic Mexican hat wavelet, whose 'footprint' is an ellipse centered at the origin, the 
directionality of the wavelet becomes very poor. A typical directional wavelet is the 2D Morlet 
wavelet (1.1). 



3. Conical wavelets 

However, some directional wavelets (e.g. Morlet) have a good capability of directional filtering, 
but their angular selectivity remains poor. Thus was introduced the concept of a conical wavelet 
[5], which generalizes the ID Cauchy wavelet of Paul (see [4, Sec. 2.1]), namely, 

7 . . / 0, for < 0, , . 

In ID, the positive half-line is a convex cone. Thus a natural generalization to 2D (and in fact 
to nD) will be a smooth function "ip^^k)^ with support included in a strictly convex cone C with 
apex at the origin, and which behaves inside this cone as P(A:i, A;2, /c^)e~^-^, where C ^ C and 
P is an n- variable polynomia 

By analogy with (3.1), the radial behavior in e~^-^ of this conical wavelet gives it its specific 
name of Cauchy wavelet. In order to get a better radial localization, the slowly decreasing 
exponential term can be replaced by a Gaussian in kx [18, 5]. We will use the term Gaussian- 
Conical for this specific case, that we will discuss in Section 3.2. 
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3.1. The 2D Cauchy wavelet 

We take first the 2D Cauchy wavelet, following [6, Sec. 3.3.4]. For simplicity, we consider a 
strictly convex cone, symmetric with respect to the positive fc^^-axis, namely 

C := C{-a, a) = {fc G I - a < arg < a, a < 7r/2}, 

that is, the convex cone determined by the unit vectors e-a^Sa- The dual cone, with sides 
perpendicular to those of the first one, is also convex and reads: 

C = C(-5, a)^{k^B?\k-k' V fc' G C{-a, a)}, 

where a — —a + 7r/2. Therefore e_Q, • e q, = e q, • e_Q, = 0. Thus a is the aperture of the cone C, 
5 the aperture of the dual cone 

Given the fixed vector rf = (77, 0), 77 > 0, along the axis of the cone, we first define the Cauchy 
wavelet in spatial frequency variables: 

ijig(k) = I • • ^-^^"^ ^ ^ ^) (3.2) 

[ 0, otherwise. 

The Cauchy wavelet iljig{k) is strictly supported in the cone C(— a, a) and the parameters 
l,m ^ N*, l,m ^ 1, give the number of vanishing moments of '0 on the edges of the cone, and 
thus control the regularity of the wavelet. One mostly uses the symmetric version of the wavelet, 
with / = m. The advantage of this wavelet is that its expression in x-space can be computed 
analytically [6, Eq.(3.29)], but we won't need it in the sequel. 

We note in passing that the Cauchy wavelets have minimal uncertainty, in the sense that they 
minimize the product of the variances (uncertainties) of a pair of noncommuting elements of the 
underlying Lie algebra of the similitude group. This is the exact equivalent of the usual minimal 
uncertainty relation ApAx ^ h/2 familiar in quantum mechanics, or AB{(jj)Ax{t) ^ 1, for a 
signal, with B{(jj) the bandwidth. We refer to [6, Sec. 8.2] for a discussion of this topic 

3.2. The 2D Gaussian- Conical wavelet (GC) 

By definition, the 2D Cauchy wavelet has the property that its opening angle a is totally 
controllable [6], independently of the amplitude, thus it avoids the drawback of Morlet men- 
tioned above. Nevertheless, although it has a good angular selectivity^ its radial selectivity is 
poor because the exponential term decays slowly as |A:| ^ oc. This is why this exponential is 
often replaced by a Gaussian along kx^ which concentrates the wavelet on its central frequency 
{\/l + m, 0) [18, 5, 14]. Then a > controls the scale localization of the Gaussian. A center 
correction term^ x(^) — Vl + tu^^^ controls the radial support of '0. Thus we obtain the 
expression of the Gaussian-Conical or GC wavelet, in frequency space [6, Eq.(3.37)]: 

- I 0, otherwise. ^^'^^ 
The GC wavelet is shown in Fig. 1. 

4. The new 2D+T, motion-tuned, Gaussian-Conical-Morlet (GCM) wavelet 

4-1- Construction of the GCM wavelet 

Building a wavelet for directional velocity analysis, in a sequence of images, starts by building 
a spatio-temporal 2D+T wavelet. Our purpose is to analyze motions in very precise directions 
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Figure 1: The 2D Gaussian-Conical filter in the (kx^ky) plane: the "shark" wavelet; side and top views. 
The parameters are a = 10°, / = m = 4. 



and with easy adjustment of this direction. Because the 2D Gaussian conical filter has good 
capacities in spatial resolution and orientation, as well as aperture selectivity, it has been chosen 
for the spatial component of new velocity-tuned wavelet. 

As for the ID temporal direction, using a conical wavelet makes no sense for velocity tuning, 
since a ID convex cone is a half-line, as in (3.1). On the other hand, with the Duval-Destin- 
Murenzi (DDM) wavelets [11, 12], the Morlet wavelet has demonstrated its ability to be shaped 
and elongated both in the spatial and the temporal directions, thus ensuring the adaptation to 
a psychovisual behaviour as well as the tuning to speed. Because of its Gaussian shape, this 
wavelet behaves like an ellipsoid. Elongating this ellipsoid in the vertical or horizontal direction 
(i.e., along kt or kx) is easily done by means of the speed-tuning operator introduced in Section 
4.2. Furthermore, the displacement along an hyperboloid of velocities ensures speed capture by 
"catching" the inclination of the Fourier spectrum of a signal for a very specific speed, thus the 
speed of this signal. And the behavior of the Morlet ellipsoid with speed tuning perfectly fits 
the slope of the spectrum: a horizontal (in the k direction) ellipsoid for low speeds and a vertical 
(in the uj direction) ellipsoid for high speeds. For this reason we make the choice of a Morlet 
filter along the temporal wave-vector axis. 

As for the DDM case, the resulting 2D+T wavelet is constructed in a separable way in 
frequency space: the velocity-tuned 2D+T filter is simply the product of the 2D Gaussian- 
Conical wavelet with the ID Morlet wavelet. We call the resulting wavelet 2D+T Gaussian- 
Conical- Morlet (2D+T GCM, or simply GCM). It is given in Fourier space as 

ky) ■ ^^{uj) , k e C(-a, a), 



2D Gaussian-Conical ID Morlet (4-1) 

, 0, otherwise. 



with — e ^o)^ and cjq the central frequency of the Morlet wavelet. Explicitly, the 

expression of GCM, in Fourier, is given by 

^l^ (^''^^-\ 0, otherwise. ^^'^^ 

Note that more sophisticated spatio-temporal wavelets may be designed. They are adapted 
to a specific relativity group (Galilei, Poincare) and are derived from a representation of the 
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latter [1, Sec. 15.3]. For simplicity, however, and for the sake of comparison with previous works 
on motion analysis, we restrict ourselves to the separable case. 

Correspondingly, we consider (2+l)-dimensional signals s (image sequences) of finite energy 
s G L2(M2 X M, df dt): 

^'^ - I I df dt < oc. (4.3) 



The Fourier transform of s is defined, as usual, by 



?(fc, uj) = (2^)-3/2 / / df dt e-^(^-^+^^)5(f , t). (4.4) 



where k is the spatial frequency and u is the temporal frequency. 

4-2. Tuning the GCM wavelet to motions 

Several motion operators can be applied to the "mother" wavelet ^, namely, scaling (L)), 
translation (T), rotation (i?) and speed tuning (A). The action of these operators on the wavelet 
is given by 

ij]{k,uj) = asa]'^i^{ask,atuj), 
[f^'^ i^]{k,uj) = e-^(^-^+^^)^(fc,cj), (4.5) 
[R'i^]{k,uj)^i^{T-'k,uj), 
[A^^](fc,cc;) = ^(c^fc, c-^u). 

As explained in [6, Sec. 10. 3], these operations form a group, with group parameters g = 
{6, T, ^; tts, ttf, c}. Notice that we consider here different scaling parameters in space and time, 
as and at, respectively. Applying these operations on the wavelet '0, we obtain the dilated, 
speed tuned wavelet denoted i^^^Q.^ ^ ^- In terms of the latter, we may now define the 2D+T 
continuous WT of the signal s E L?{M? x R, dx dt) as 

W^s{h, T, e- as, at, c) = (^g^^^^.^^^^^^, I s) 



-1 -1/2 

^ a, a+ ' 



J J dk doj e-^(^-^+^^) ^(a,c^ r-^k, atC-Pu) s{k, u). (4.6) 



Like for the speed-tuned Morlet wavelet, the application of these operators to the GCM 
wavelet is done separately on the spatial 2D conical filter and the temporal ID Morlet filter, 
depending on the temporal or the spatial nature of the transformation. In particular, the speed 
tuning is realized by the deformation of the space-time domain with speed. The spatial wave- 
number (or position) is multiplied by a factor proportional to the speed, and the temporal 
wave-number is multiplied by a factor inversely proportional to the speed. These factors are 
adjusted by the two speed exponents p = 2/3 and q = 1/3 whose values are fixed by the 
constraints on the transformation A*^ [12]. 

After implementation of the transforms defined in (4.5), we can give the expression of the 
2D+T GCM in terms of the group parameters g = {6, r, 9; as, at, c}: 



X expf-Ka^c^/c^; - x(c^))^] exp[-i(c"^atCc; - cjq)^], k G C{-a, a), 
0, otherwise. 

(4.7) 



6 



1 20 








1 








1 


^^^^ 






1 








1 








1 20 40 


60 80 


100 


120 



Speed-tuned GCM: apert = pi/16, sigma= 1, moments(l,m,n) = [101010], speed = 1, 







20 40 60 80 100 120 




Figure 2: The 2D+T Gaussian-Conical-Morlet (GCM) wavelet tuned to speed: (a) Top view in the plane 
{kx^ ky) for c = 1 (i.e. this is the mother wavelet) and aperture a = 7r/16; (b) Lateral view of the same in 
the plane (kx^oj)] (c) 3D view of the GCM envelope in Fourier space {kx^ ky^uo) for c = 1; (d) 3D sectional 
views in {kx^ky^uo) showing the Gaussian amplitude behaviour on the (vertical) Morlet part, as well as 
on the (horizontal) conical part, and for c = 2. 



The 2D+T GCM wavelet tuned to speed c = 1, i.e., the mother wavelet for speed analysis, is 
shown in Fourier space {kxi ^y?^) Fig. 2 (and its sectional view at c — 2, for sake of clarity). 
In Fig. 3, we show the same mother GCM wavelet together with speed-tunings at low (c = 0.4) 
and high (c = 4) speeds. This figure perfectly describes the similarity of psychovisual behavior, 
with speed variations, between GCM and 2D+T Morlet. 

From here on, our notation will be the following: The wavelet tuning speed is c, the speed of 
the real object in the sequence is Vr and i;^ will be the speed measured, on the energy curve, 
by the speed-tuned wavelet family. 

4-3. Central frequency and speed capture initialization 

The GCM wavelet can be transformed to a simple low-pass filter by centering the wavelet in 
the Fourier plane. This is done by translating the wave- vector k of the conical component by the 
value fco (see below) of its central frequency, and by cancelling the term cjq for the Morlet part. 
With this condition, the conical and the Morlet wavelets do not oscillate any more, in fact they 
are no longer wavelets, but simply filters. They also cannot be tuned to scale. The advantage 
is that all the "GCM speed-tuned filters" so obtained are centered in the Fourier plane and can 
easily capture the initial speed, due to their shape, independently of the object scale. The value 
of the new central wave-vector of our speed-tuned 2D+T GCM can simply be deduced from 
the static version. The GCM wavelet has been constructed with axis f] in the same direction as 
kx- Thus the "static" central wave- vector corresponds to a maximum of the wavelet along the 
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Figure 3: The "progressive" wavelet GCM tuned to various velocities with 3D view (left), side view 
(center) in {kx^oj) and top view (right) in {kx^ ky): (a)(b)(c) This is the mother wavelet for speed analysis, 
i.e. with velocity tuning c = 1 (shown here for sake of comparison with two other velocity tunings below). 
The other parameters are: a = 1; / = m = 10; = = 1; a = 7r/16;0 = 0; (d)(e)(f) c = 0.4 pix/fr, 
(7 = 2; (g)(h)(i) c = 4 pix/fr, a = 2. We notice on (f) and (i) how the (spatial) frequency size is reduced 
with speed. This demonstrates the good psychovisual behavior of wavelets with speed. We know that the 
higher the speed the larger the pattern size must be. Here, the wavelet proportionally shrinks in Fourier 
space (thus widens in position space) with an increase of speed, in order to capture higher speeds. 



kx axis. A derivation of ^^^^ leads in a straightforward manner to the following value of the 
"static" central wave- vector: kx^ = \/l + m [14]. If we remark that, in the speed-tuned GCM, 
the k vector is multiplied by c^, we can thus write the central wave- vector for speed-tuning 



m 



(4.8) 



Because the value of /cq is also affected by all the transformations that the wavelet undergoes, we 
thus compensate the conical wave- vector k by the central frequency computed for all transforms, 
which gives 

1 1 VTTm, 



ko 



-{cos 9, sini 



(4.9) 
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The expression of GCM centered in Fourier space becomes 

' a7ia7^/^e-*(^-^+'^^) {ciasr-\k - ko) • eaY {ciasr-\k - ko) • e-a)*" 

X exp[-f ((fc^ - k^) - X(o-))^] exp[-^{c-Patiuj - ujq))^], k G C{-a, a), 
0, otherwise 

(4.10) 

where, as before, the group parameters are g = {6, r, 9; a^, a^, c}. 

4-4- Angular resolving power 

If the wavelet ^ has its effective support, in spatial frequency, in a vertical cone of aperture 
A(/), corresponding to ko = (0, fco), the wavelet is considered as concentrated in an ellipse. Its 
angular resolving power (ARP) is defined by considering the tangents to that ellipse [6, Sec. 
3.4.1]. For Morlet, and for ko » 1, this gives 

ARP{^^) = 2cot-\/coV^) (4.11) 

which means that the angular resolving power of Morlet only depends on the product koy/e, a 
result first proposed in [2]. For the symmetric Cauchy or the conical wavelet in general, with 
support in the cone C{—a,a), the ARP is simply its opening angle, or aperture, 2a. 

4-5. Admissibility, frame bounds and reconstruction 

Frame theory addresses the issue of reconstructing a signal s{t) from sample coefficients of 
a particular transform. In general, a family tp^ in L'^{X) is a frame if there exist A > and 
B < oo such that for, all s G L'^{X) : 

n 

where A and B are the frame bounds. 

Most reconstruction algorithms from frame coefficients are of the recursive type or require the 
calculation of a dual frame, recursive as well. These algorithms generally require that the frame 
bounds be known by advance. The convergence rate also depends on the ratio B/A which is 
an important issue in the use of the spatio-temporal CWT transform; as is known [10, 15], the 
closer B/A to 1, the faster the convergence. We are interested here in knowing the values of the 
parameters that yield fast convergence with the GCM CWT transform. In ID, estimates for 
the frame bounds have been given by Daubechies [10, Proposition 3.3.2]. The same problem has 
been studied by Murenzi in the 2D case (see [6, Section 2.4.2]) and by Mujica for the 2D+T case 
of DDM wavelets [16]. We briefly recall some important steps of this issue and the conditions 
under which the family considered is a frame, presents a stable reconstruction and provides fast 
convergence rates. 

The first step is to discretize all wavelet parameters. Let us put 

a = ttQ, c = Cq , = qOo, ag, cq > 1, = vr/gi, with qi G ZJ^n^q ^ Z. 
Then, one defines the following quantities: 

A(fc,a;)= J2 |V'(4c^/'r-''o'^fc, a[,c-'"/'c^)|2, (4.13) 

r(6,r) = sup J2 \McTr-''°k), {a'oco'''^'co)\' |V'(4c^/V<?''°(fc - 6), ^Co^^/^c. - r))|, 

(4.14) 



E 

mx,my,pe1 
/(0,0,0) 



r('Umx,my,fp)r( ^mx,my,-fp)^ (4.15) 
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Figure 4: Comparison of aperture selectivity tuning between Morlet and GCM. The figure shows that 
both wavelets are easily rotated. But, on panel (a), the Morlet wavelet behaviour, w.r.t. a tuning of its 
central wave- vector /cq, is awkward : a change of ko induces a change in its opening angle a, but the 
wavelet moves on its radius with /cq, then its radial extension considers a different frequency bandwidth 
each time. On the contrary, panel (b) shows that on GCM, a variation of the aperture does not imply 
any displacement on its radius and that the radial extension is constant and easily adjustable. This is a 
major strength of GCM : its radial stability, with a variation in aperture, and its ability to be radially 
adjusted, which are very important for object recognition and tracking by spectral signature. For sake of 
clarity we show the result for several rotations of GCM. 



with Ur, 



27Tmx '^TTrUy 

bxQ byQ J 



, = and 6:^0, 6^0, TO > 



With these quantities, the extension of Daubechies's result to 2D+T frame bounds becomes 

Proposition 4.1 (Estimates of the frame bounds for the 2D-I-T case). If ipd, <^0; cq ctnd 9q are 
such that : 

A_ = inf A(fc,cj) > 0, A+ = supA(fc,cj) < oc, (4.16) 



'XQ < bxQ^, < byQ < byQ^, < To < To*. Moreover, the following are frame 



and ifr{b,r) decays fast enough, then there exists 6^0*; 
frame for all < 
bounds for tj^d - 

■(A- -7), B^ l ' (A+ + 7), 



^2/0*^ "^0* > such that represents a 



A 



bxobyo^O 



bxobyoTo 



(4.17) 



With an appropriate choice of ao, cq and Oq, the values of the infimum and supremum A_ 
and A+, the bounds A and B can be close to each other. The parameters bxQ, by^ and To are 
also adjusted so that 7 is close to 0. Then the ratio B/A can be close to one, thus resulting 
in a faster convergence rate of the reconstruction algorithm. Nevertheless, finding the infimum 
and supremum over the whole frequency space (k^u) implies difficult calculations. This is why 
a constraint or periodicity is introduced for the functions in (4.16) in order to find the values of 
the extrema A_ and A+. This periodicity can be found by defining a volumetric period and we 
refer to [16, Eq.(4.60) and Table 4.4] for the definition of this volumetric period and the values 
of the frame bounds A and B for different values of the discretization parameters ao, cq and 
^0- The resolution grid of the speed cq and orientation ^o parameters must also be tuned to the 
GCM wavelet, but this is planned for a future work. 
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4-6. Analysis algorithm 

For the analysis, we have to discretize the continuous CWT given in (4.6). This means, given 
a sequence of N frames, corresponding the time variables r^, i = 1, . . . , A^, we take the product 
of the whole sequence FFT with each speed-tuned wavelet FFT, then we apply an inverse 
FFT. This yields the wavelet transform of the whole sequence, W^(b^Ti,9;as^at^Cj), with all 
the parameters suitably discretized, in particular c = cj and r — Ti. We then compute the 
energy density of the z*^ frame |yV^(cj, Ti)p, taken as a function of speed Cj only, all the other 
parameters being fixed. Then, selecting a frame or a group of A^o frames among the N frames 
of the sequence, we compute the total energy, £^tot(cj), of the pixels of the selected frames A^o- 

Etot{cj) = 5^ 5^|W^(6^^,c„T,)p. (4.18) 

In this relation, hnm denotes a discretized version of the 6-plane, the summation running over all 
pixels of the frame. Note this expression is analogous to the speed-orientation density described 
in [17, Eq.(40)] or [6, Eq.(10.60)], except that here we sum over all pixels of the frame (i.e., 
B — R^), because we want a global energy density. 

Then we study the curve f{cj) \— Etot{cj). This curve will go through a maximum, 
when the speed of the tuned wavelet cj matches the real speed Vr of the object (a travelling 2D 
Gaussian here), see an example in Fig. 5(b). Several algorithms could be used to optimize the 
search for the maximum of the energy. This could be done, for example, by using a dichotomy 
rather than sweeping the whole range of velocity tunings, or by using the Nelder-Mead algorithm 
as in [17]. 

5. Experimentation 

We first recall the results obtained in speed computation [9] between the Morlet speed-tuned 
CWT and the Optical Flow: 

Total computation speed (Xeon bi-processor at 2.4 gHz): 

(1) MTSTWT with 3 wavelets tuned to 3 speeds (3,6,10 pixels/fr) on a 360 x 240 x 8 sequence 
block (Tennis sequence) at the highest resolution of scale: ^MTSTWT (1200ms) + 3 x IFFT3D{3 x 
380 ms) 2.4 s. 

(2) Fast Optical Flow with wavelets [7] between two frames and 4 different resolutions : toF 
=10 s. 

In Fig. 4, we show the orientations that the Morlet and the GCM wavelets can take for 
0^9^ +7r/2. This figure speaks by itself. Morlet is shown with three orientations. In fact this 
figure illustrates the major problem of aperture adjustment with the Morlet wavelet : its opening 
angle is mainly trimmed through a variation of its central wave- vector ko. And the larger the 
value of fco, the smaller the aperture of the directional Morlet. But the major drawback of this 
adjustment is that the frequency bandwidth that Morlet can capture also increases with the 
value of ko. And because this bandwidth corresponds to the spectral signature of the moving 
object, it can become difficult to adjust the aperture of the filter during object tracking. The 
effect of the anisotropy parameter e does not have this drawback. But its effect is relatively 
limited to a variation of the aspect-ratio of the wavelet along a specific direction. Here, we have 
also modified (increased) the value of e in the ky direction (thus resulting in a elongation in 
the kx direction). The displacement with ko is large enough to see, on the same radius, all the 
shapes that Morlet takes if we increase the couple (A:o,e). We show 3 couples ko = {6, 12,22} 
and e = {1,2,8}. For small values of ko and/or e, the aperture is very large. This aperture 
decreases by increasing ko and e. But as ko increases, the Morlet wavelet moves away from the 



11 




Figure 5: Demonstration of speed capture in the Fourier domain with Vr = 3. (a) The spectrum of the 
sequence "travehing Gaussian" (in red) intersects the speed-tuned GCM wavelets hyperbola-hke family 
(in blue), (b) The energy sum plotted w.r.t wavelet speed tuning (c). The maximum is exactly reached 
for c = Vr = 3 pixels/fr. 



Fourier center (0,0), along its radius. This makes very difficult to adjust the spatial positioning 
of this wavelet w.r.t. a change in aperture selectivity. 

With GCM, on the contrary, the couple orientation/aperture is extremely simple to adjust. 
We show 5 orientations 6> = {0 to 7r/2} with GCM tuned to aperture a = {7r/256 to tt/IG}. This 
proves the very good aperture selectivity of GCM. But it also proves its very good stability in 
radial expansion and position with a variation of aperture. These last characteristics, together 
with the adjustability of GCM in radial expansion and position, prove the real superiority of 
GCM on the 2D+T Morlet for object recognition and tracking by spectral signature. 
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Figure 6: Result : Performances of GCM in directional speed capture: we plot the curve Vm vs. O^av for 
— 7r/2 ^ 6> ^ +7r/2 for an aperture a = n/W and '^^ = 3. The correct speed is captured when the wavelet 
orientation {0 = 0) exactly corresponds to the spectrum orientation (OX). 

We now perform a comparison between the 2D+T Morlet and the 2D+T GCM wavelets. We 
use a test sequence of 128 x 128 x 32, that includes the motion, at constant speed, of a 2D 
non-symmetrical Gaussian. The angles of the Gaussian and of its trajectory are varied (along 
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ox, at 45° and along OY). 

The first experiments liave been done witii an isotropic Gaussian travelling at constant speed 
in the plane. We checked that speed capture is as good with GCM as with Morlet. But this is 
not the real interest of this study. We then modified the Gaussian to exhibit a strong anisotropy 
in the direction OY in the spatial plane. For this we took a large ay (i.e., in the OY direction) 
and a small ax- The larger a^, the narrower its spectrum along the kx direction in the Fourier 
domain. Thus we are able to test the angular, or more exactly, the aperture selectivity of the 
wavelet, i.e. its accuracy in directional speed capture. The orientation of the object will also be 
its spectral "signature" and will enable to "capture" it in a sequence and to assign it its speed 
(Fig. 5). 

In the 3D Fig. 5 (a), we can observe the spectrum of the travelling Gaussian, oriented along 
OX (OY in the direct space) and of large a^, and travelling (along OX in the direct space) at a 
speed Vr = 3 pixels/fr. The figure shows the "interception" of the spectrum by the hyperbola- like 
family of GCM wavelets tuned to speeds between 1 and 6 pixels/fr. The aperture is a = tt/IG 
and the spatial and temporal scales are = 3 and at = 3. By looking again at Fig. 4, panel 
(a), it is obvious that in the same conditions, a change of aperture of the Morlet wavelet would 
result in a displacement on its radius, thus in a change of spatial position and scale. The poor 
angular selectivity of Morlet, together with speed-tuning, thus makes the directional capture of 
the spectrum very difficult. This does not happen with GCM. This makes it very efficient in 
capturing the signal speed at any angle within a constant narrow conical aperture a = tt/IG 
and not outside it. We will also show further that GCM can achieve much narrower apertures 
in speed-capture. 

In Fig. G, we plot the curve Vm vs. O^av for —7t/2 ^ 9 ^ +7r/2, with GCM. The correct 
speed is captured when the wavelet orientation (0 = 0) exactly corresponds to the spectrum 
orientation (OX). This proves the good angular selectivity of GCM, that could not be reached 
with Morlet, and its efficiency to detect the correct speed of the sequence in a very narrow 
angular aperture and not elsewhere. 

The final Fig. 7 demonstrates the excellent performance of GCM during a speed cap- 
ture at i;^ = 4 and for increasingly narrow conical apertures. We have taken apertures of 
7r/8, tt/IG, 7r/G4, 7r/25G (0.70 deg). The capture conditions are those of Fig. 5, where the spec- 
trum, oriented in the OX direction, is narrow, and for a real speed Vr = 4. GCM captures the 
spectrum for extremely weak apertures without moving in the Fourier space. This makes it very 
robust to scale initialization and object size detection and tracking, which is one a the major 
drawbacks of speed capture with Morlet. 

6. Conclusion 

In this paper, we have introduced a new tool, based on the redundant CWT, for directional 
speed analysis in video sequences, namely, a new speed-tuned wavelet, the Gaussian-Conical- 
Morlet (GCM). It is based on a 2D Gaussian-Conical genuine directional wavelet, on one hand, 
together with a temporal ID Morlet wavelet, on the other hand, and it provides a very high 
directionally selective speed-tuned wavelet. This substantially improves the characteristics of 
the 2D+T Morlet wavelet previously used in spatio-temporal speed-based approaches of motion 
estimation and tracking. It thus reinforces the efficiency of the CWT tool for speed and motion 
tracking, and in particular its inherent robustness to noise, occlusions and local illumination 
variations as well as its efficiency to long dependence analysis. In this work we have proved 
the extreme efficiency of this tool in directional speed selectivity (the aperture of the conical 
wavelet) down to angle apertures of less than 1 degree (7r/25G) but also its capacities of radial 
stability and adjustment, w.r.t. aperture variation, that make it a much more powerful tool 
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Figure 7: This figure shows a speed detection for an object speed Vr = 4, hke in Fig. 5(b), but for a 
more and more narrow wavelet aperture a, respectively (from (a) to {d)) : tt/S, 7r/16, 7r/64, 7r/256. Here the 
GCM parameters are a = 1, / = m = 10, /co = 6 and as — at — 3. We observe a very good capacity of capture for 
a very narrow aperture (7r/256 = 0.70 deg). 



than the 2D+T Morlet wavelet for spectral signature recognition and tracking. An initialization 
of the speed capture has also been proposed by turning the GCM wavelet to a simple low-pass 
directional filter centered in the middle of the Fourier plane. These two contributions will be 
pursued with, among other topics, applications in the domain of object and motion tracking in 
video sequences and in the domain of feature retrieval. 
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